The Enumeration of Information 
Symbols in BCH Codes 

By E. R. BERLEKAMP 

(Manuscript received May 9, 1967) 

This -paper presents certain formulas for I(q, n, d), the number of 
information symbols in the q-ary Bose-Chaudhuri-Hocquenghem code of 
block length n = q m — 1 and designed distance d. By appropriate ma- 
nipulations on the m-digit q-ary representation of d, we derive a simple 
linear recurrence for a sequence whose mth term is the number of informa- 
tion symbols in the BCH code. 

In addition to an exact solution of all finite cases, we obtain exact as- 
ymptotic results, as n and d go to infinity while their ratio n/d remains 
fixed. In this limit, the number of information symbols increases as n'. 
Specifically, we show that for fixed u, ^ u ^ 1, 

Urn q~""I(q, q m — i,uq m ) = 1, 

m-»oo 

where s is a singular function of u. The function s(u) is continuous and 
monotonia nonincreasing; it has derivative zero almost everywhere. Yet 
s(0) = 1 and s(l) = 0. For g = 2, s(u) is plotted in Fig. 1. 

Any cyclic code of block length n over GF(q) may be defined by its 
generator polynomial, g(x), which is some factor of x n — 1 over GF(q), 
or by its check polynomial, h(x) = (x n — l)/g(x). The number of 
check digits in the code is given by the degree of g(x); the number of 
information digits, by the degree of h(x). We assume that n and q 
are relatively prime. It is most convenient to work in a particular ex- 
tension field of GF(q), namely GF(q m ), where m is the multiplicative 
order of q mod n. In this field, x n — 1 factors into distinct linear factors: 

x" - 1 = fl(.r -a). 

v = i 

Here a is any primitive nth root of unity in GF(q m ); a* — 1. From the 
factorization x" - 1 = g(x)h(x), we see that every power of a is a root 
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Fig. 1 — Graph of s(u) vs. u. 



of either g(x) or of h(x), but not both. Thus, a cyclic code partitions 
the powers of a into two sets: those powers which are roots of the 
generator polynomial, and those powers which are roots of the check 
polynomial. If g(x) and h(x) were permitted to have coefficients in 
GF(q m ), then any partition of the powers of a would define a cyclic 
code. However, the coefficients of g{x) and h(x) must lie in the ground 
field GF(q). Consequently, if a is a root of g(x), then so are the con- 
jugates of a, namely a", a"', a"*, • • • . Conversely, if all conjugates 
of roots of g{x) are also roots of g(x), and all conjugates of roots of 
h(x) are also roots of h(x), then all of the coefficients of the polynomials 
g (x) and h(x) lie in GF(q). 

The previous remarks hold for all cyclic codes. 

A 5-ary BCH code of block length n over GF(q) may be defined as 
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the cyclic code whose generator's roots include only a, a, • ■ • , a" 
and their conjugates. This code is capable of correcting any combina- 
tion of less than d/2 errors; (c/. Berlekamp 1 ) the minimum Hamming 
distance of this code is at least d. For this reason, d is called the designed 
distance of the code. 

The first result on the number of information symbols in BCH codes 
is the following lemma: 

Classical Lemma I: Let I(q, n, d) be the number of information symbols 
in the q-ary BCH code of block length n and designed distance d. 

Define [~i~| by the equations 

i = [f\ mod n and 1 ^ [i~\ ^ n. 

Then I(q, n, d) is the number of integers j, such that 1 ^ ; ^ n and[jq l ~\ ^ d 
for all k. 

Proof: a 1 is a root of the generator polynomial of the BCH code iff 
there exists some k(j) such that \~jq k ~\ < d. Conversely, a' is a root 
of the check polynomial iff [jq k ~\ ^ d for all fc. Q.E.D. 

The classical lemma enables one to compute the number of informa- 
tion symbols in any g-ary given BCH code without doing any cal- 
culations in GF{q) or its extensions. One need only enumerate certain 
types of residue classes mod n. In practice, this enumeration is still 
often tedious, particularly when n and d are large. 

In order to obtain more tractable results for large n and d, we prefer 
to start from an alternate form of the classical lemma: 

Classical Lemma II: Let I(q, 11, d) be the number of information symbols 
in the q-ary BCH code of block length n and designed distance d. 

Define \_i] by the equations 

i = [_ij mod n and ^ [_ij ^ n — 1. 

Then, I(q, n, d) is the number of integers i, such that ^ i ^ n — 1 
and L?2*J < n + \ - d for all k. 

Proof: 1 g j S n and [>/~l ^ d for all k iff ^ (n - j) ^ n - 1 
and [(» - iV/J %n - d for all k. Let i = n - j. Q.E.D. 

In the wide sense, BCH codes may be defined over any alphabet 
whose order, q, is a prime power, and for any block length, n, which 
is relatively prime to q. In the narrow sense, however, n is required to 
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be one less than a power of g. For narrow sense codes, the smallest 
extension field of GF(q) which contains the nth roots of unity is GF(n+l). 
For wide sense codes, this extension field is always larger, usually 
much larger. Since the decoder must perform certain computations 
in this extension field, narrow sense BCH codes are more easily im- 
plemented than their more general wide sense counterparts. 

We shall enumerate the information symbols in narrow sense BCH 
codes by reducing the problem to the enumeration of certain kinds of 
sequences over the alphabet consisting of the integers 0, 1, 2, • • • , q — 1, 
as first suggested by Mann. 2 We begin by defining the appropriate 
manipulations with such sequences. 

We shall always use capital letters for sequences. We let (Q — 1) 
denote the sequence consisting of the single letter q — 1. Unless other- 
wise stated, we allow every sequence to be either finite or infinite. 

Let V = ViV 2 V 3 • • • be any finite or infinite g-ary sequence (i.e., 
a sequence of numbers V { , where V { is an integer, S» Vi ^ q — 1. 
We let V = ViV 2 V 3 • • • denote the complement of V, defined by V ( = 
(q — 1) — Vi for all i. HW = W X W 2 ••• W k is a finite g-ary sequence, 
then we may form the cyclic shifts of W: W 2 W 3 • • • W k Wi , W2W4 
• ■ • W k W l W 2 , • ■ ■ • If X is a finite g-ary sequence, X = X X X 2 ■ ■ ■ 
Xj , then we may form the concatenation X * V = XiX 2 X 3 • ■ • 
XjViV 2 V 3 • • • . This concatenation may be formed whenever V is 
a finite or infinite g-ary sequence. If V is a finite g-ary sequence, then 
V * X is a cyclic shift of X * V. 

The g-ary sequence Y is said to be a prefix of X iff X = Y * Z for 
some Z; Y is said to be a suffix of X iff X = Z * Y for some Z. A prefix 
must be a finite (or empty) sequence; a suffix may be empty, finite, 
or infinite. V is a proper prefix of X iff X = V * Z, and neither V nor 
Z is empty. Z is a proper suffix of X iff X = V * Z and neither V nor 
Z is empty. If X is a finite g-ary sequence, X = XiX 2 • • • X k , then we 
may form the iterated concatenation of X with itself, X = X a X 2 • • • 
X k XiX 2 • • • X k • • • . In particular, (Q — 1) denotes the infinite g-ary 
sequence all of whose letters are q — 1. 

We say X < Y iff there exists a ; such that X,- = F, for i = 1, 2, • • • , 
;' - 1, but X, < Yi . If X < Y and Y < X, then one is a prefix of 
the other. 

This ordering is similar to the numerical ordering of g-ary fractions, 
but there are important differences. For example, \ = 0.01 < 0.0101 — 
5/16, but the sequences 01 and 0101 are incomparable, because one is 
a prefix of the other. On the other hand, 0.0111111 • • • = 0.1 = \, 
yet 01111 • • • < 1. This type of example may be excluded by writing 
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all fractions in their terminating form if they have one. We may then 
assert the following: 

Let 

U = Z U,q-\ V - E V t q-\ U = UtUnUt ■ ■ • 

and V = l'iTol'3 • • • , and suppose that (Q — 1) is not a suffix of U or V. 
Then 

U < V =» u < v 

U < V 



u < v <=> 



or 
[U is a prefix of V. 



We say that X is an immediate subordinate of F iff X is a finite se- 
quence, X = X : X 2 X 3 • • ■ X k , and X Y = Yj , X 2 = Y 2 , - • • , X k -i = 
Yk-i} but X k < Y k . The sequence Y has Y x immediate subordinates 
of length 1, Y 2 immediate subordinates of length 2, Y 3 immediate 
subordinates of length 3, • • • Y k immediate subordinates of length k. 
If the sequence Y has only a finite number of nonzeros, then we may 
define the greatest immediate subordinate of Y. If the last nonzero in 
the sequence Y = Y , F 2 • • • is Y k , then the greatest immediate sub- 
ordinate of Y is Y,Y 2 ■ ■ • F*_j(F t — 1). If the sequence Y contains 
an infinite number of nonzeros, then Y has infinitely many immediate 
subordinates. All of them are less than Y itself, but none of them is 
the greatest immediate subordinate. 

Similarly, we say that Y is an immediate superior of X iff Y = FiF 2 F 3 
• • • Y k , where F, = X x , F 2 = X 2 , • • • , F t _, = X k . r but Y k > X k . 
If X = X X X 2 • • • X k and X k ?± (Q — 1), then the least immediate 
superior of X is F - F^ • • • F» ; F, = X, for ** = 1, 2, ■•-,&- 1, 
and F* = X k + 1. It should be evident that the least immediate superior 
is among the longest immediate superiors, and the greatest immediate 
subordinate is among the longest immediate subordinates. 

Definition: If q is any integer, U is any infinite g-ary sequence and 
m is any integer, we define J(q, U, wi) to be the number of g-ary m- 
tuples all of whose cyclic shifts are less than U. 

Lemma III: (Complemented form of Mann's Lemma) 

// 

n = q m - 1, n + 1 - d = Jl U t q m ~\ ^ U, < q, 



1866 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1067 

U = Ui U 2 • • • U„ , and Y is any q-ary sequence then 
I(q,n,d) = J(q, U* Y,m). 

Proof: Lemma III reduces to Lemma II under the following cor- 
respondence: The g-ary m-tuple U corresponds to the integer n + 
1 — d; another g-ary m-tuple W = W Y W 2 • • • W m corresponds to the 
integer w = Z"-i W i q m ~ i . The first cyclic shift of W is the sequence 
W 2 W 3 • • • TF m TFi , which then corresponds to the integer 

£ TV + l - + IF, = qw - (q m - 1)TF, . 
i = i 

Modulo n = q m — 1, the integer corresponding to the first cyclic shift 
of W is seen to be congruent to qw. Therefore, the successive cyclic 
shifts of an m-digit q-ary sequence W correspond to the integers [_wj, 
\_wq_\, \_wq'^, ■ •• , L^9"' -1 J- These integers are all <n + 1 — d iff all 
cyclic shifts of IF are < U, which is true iff all cyclic shifts of W are 
< U * Y, for any Y. Q.E.D. 

The choice Y = U has an interesting interpretation: 

t &«- = EI t/,<r (i+mt) = (z u iq -)/{\ - q ~n 

(d- 1) 
n 



= Ef/,r'¥-i) = i - 



Thus, the sequence U is the 5-ary expansion of 1 — (d — l)/n. For 
this reason, we may investigate the behavior of I(q, n, d) for large n 
and d with a fixed fractional error correction capability, (d — l)/2n, 
by studying J(q, U, m) as a function of m for fixed q and U. 

We shall temporarily ignore the periodicity of the U sequence, 
and consider the function J(q, V, m) for an arbitrary g-ary sequence 
V. We assume only that the sequence V has no terminal zeros. 

From the definition of the immediate subordinates of V, it is clear 
that if an m-digit q-ary sequence W is less than V, then some immediate 
subordinate of V is a prefix of W. For if W is less than V, then there 
exists a k such that W { = V t for i = 1, 2, • • • , k - 1, but W k < V k , 
and the sequence WxW 2 • • • W h is a prefix of W and an immediate 
subordinate of V. 

Now suppose that some m-digit sequence IF and all of its cyclic 
shifts are less than V. Since IF itself is less than V, some prefix of W 
must be an immediate subordinate of V. Are all possible immediate 
subordinates of V possible prefixes of IT 7 ? In general, they are not, for 
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some immediate subordinates may have suffixes which are greater 
than V. If X * Y is an immediate subordinate of V and Y is greater 
than V, then X * Y cannot be a prefix of W. For, UW = X*Y*Z, 
then one of the cyclic shifts of W is Y * Z * X which is greater than V. 

For example, consider the ternary sequence V = 20212. Its im- 
mediate subordinates are 0, 1, 200, 201, 2020, 20210, and 20211. The 
immediate subordinate 20210 has the suffix 210 which is greater than 
V. Therefore, if 20210 is the prefix of TF, then the second cyclic left 
shift of W is greater than V. Similarly, V's immediate subordinate 
20211 has the suffix 211, which is also greater than V. 

For some sequences V, this difficulty does not arise. If V exceeds 
all of its own proper suffixes, then we have the following theorem: 

Theorem 1: Let V be a q-ary sequence which exceeds all of its own 
proper suffixes. Then: 

(i) No immediate subordinate of V is a proper prefix of any other 
immediate subordinate of V. 

(ii) Every suffix of every immediate subordinate of V is a concatena- 
tion of other immediate subordinates of V. 

(Hi) If W and all of its cyclic shifts are less than V, then W can be 
uniquely decomposed into a concatenation of immediate subordinates of V, 
including a (possibly empty) end-around immediate subordinate. Spe- 
cifically W = W il) * TF ,2) * • • - * W w * TF (<+1) * W (i+2) * ■■■* W u ~ l) * 
W U) \ W a \ W m , "- , W u ' l) are immediate subordinates of V; W U) * 
1V ll) * W lZ) * • • • * W w is the end-around immediate subordinate. The 
end-around immediate subordinate has a prefix, IT ', which is a suffix 
of W, and a suffix W m * W m * • ■ • * W m which is a prefix of W, as 
well as a concatenation of the shorter immediate superiors W , W (2) , 
■■■ , W i{) . 

(iv) Every concatenation of immediate subordinates of V, including 
a (possibly empty) end-around immediate subordinate yields a sequence 
which has the property that all of its cyclic shifts are less than V. No 
such sequence of length m can exceed the maximum m-digit concatenation 
of immediate subordinates of V. If Y is the maximum m-digit concatena- 
tion of immediate subordinates of V, and Y ^ U ^ V, then J(q, V, m) = 
J(q, U, m). 

(v) 

711-1 

J(q, V, m) = mV m + £ V k J(q, V, m - k), 
where F,- is taken as if j exceeds the length of the sequence V. 
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(vi) Let 

n = 3 » - l, d = 2 D t q m "', £ D t < q, 
D = D,D 2 ■■■ D m . 

if 
V * (Q - 1) < D £ 

least m-digit concatenation of immediate superiors of V, 

then, 

I(q,n,d) = J(q, V, m). 

Proofs: 

({) This property of immediate subordinates does not even depend 
on the suffix condition on V. From the definition of immediate sub- 
ordinates, each immediate subordinate must disagree with V only 
in the immediate subordinate's last digit, and hence no immediate 
subordinate can be a prefix of any other. 

(ii) Let us first prove the weaker assertion: 

(a) Every proper suffix of every immediate subordinate of V has 
a prefix which is a shorter immediate subordinate of V. 

Let S be an immediate subordinate of V, and let »S <2) be a suffix 
of S. We may write S = S a) * S {2 \ Since S differs from V only in its 
last digit, S m is a prefix of V, and V = S lli * V w . Since 8 < V, 
g<2) < yw^ g ince y excee( j s a ii of its own proper suffixes, 7 (2) < V. 

Therefore, & (2) < V. Therefore, some prefix of S (2) is an immediate 
subordinate of V. 

(b) If every suffix of an immediate subordinate has a prefix which 
is an immediate subordinate, then every suffix of every immediate 
subordinate is a concatenation of immediate subordinates. 

For, suppose F is a suffix on an immediate subordinate, then F = 
B ll) * F (2) , where B w is an immediate subordinate. Since F (2> is a suffix 
of F, it is also a suffix of an immediate subordinate, and F <2) = Z? (2) * F (3> , 
where B <2) is an immediate subordinate • • • F = B (1) * B (2) * B * • • • . 

(Hi) Since W < V, it contains a prefix TF (1) which is an immediate 
subordinate of V. After shifting this prefix around to the end, we may 
similarly identify W i2 \ W l3) , ■■• , W u ~ 1 \ each of which is an im- 
mediate subordinate of V. The sequence W U) * W w * W i2) * ■■■ * 
W li ~ l) is a cyclic shift of W, and so it must have a prefix, P, which 
is an immediate subordinate of V. P is not a prefix of W U) , so W l,) 
must be a prefix of P. Suppose that W U) * W a} * • • • * W w is a prefix 
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of P, but that W (i) * W w * ■■■ * TF ( ' +I) is not a prefix of P. (This 
defines i.) Then P = W in * W w * • • • * W w * S, where the (possibly 
empty) sequence S is a proper prefix of W <t+1) . Since S is a suffix of 
P, which is an immediate subordinate of V, S is itself a concatenation 
of immediate subordinates of V. But no immediate subordinate of V 
is a proper prefix of any other immediate subordinate of V, so S must 
be empty. 

(iv) This is the converse of (tit). Suppose we are given the sequence 
W = S in * W (1) * TF (2) * ••• * IF"" 1 ' * P U) , where W ll) , W w , ■ • • , 
W u ~ l) and W m = P U) * S li) are immediate subordinates of V. We must 
show that all cyclic shifts of W are less than V. Any cyclic shift is of the 
form C = S a) * W lh+1) * W ik+2) * ■■■ * W U) * W w * W m * • • • * 

JP<* + 1) * p(*) > where W W = p(« * S ™ m Jf gW Jg empt y j C hag the 

prefix W lk + l) , which is an immediate subordinate of V. If S lk) is not 
empty, by (it) it has a prefix which is an immediate subordinate of V, 
which is a prefix of C. In either case, C has a prefix which is an im- 
mediate subordinate of V. Therefore, C < V. 

(v) V has V m immediate subordinates of length m, each of which 
has m distinct cyclic shifts. Thus, W may be chosen as a single end- 
around immediate subordinate of V in mV m ways. 

If W is a concatenation of several immediate subordinates of V, 
W = W (l) * W w * ■■■ * W u ~ u * W U) where W a \ W l2) , • • • , W U ~ X) 
are immediate subordinates of V and W ll) is a (possibly empty) proper 
prefix of the immediate subordinate W U) * W n) * • • • * W w t then the 
length of W is the length of PT""" plus the length of W {1) * W m * • • • * 
W u ~ 2) * W u \ For each k, there are V k choices of W u ~ l) of length k, 
and J(q, V, m - k) choices for W a) * W m * ••■ * W u ~ 2) * W u \ 

(vi) Least special case: Suppose D is the least m-digit g-ary sequence 
greater than V * (Q — 1). Letting 

d = HDiq""', v = S7,g n ~ , > v = 27/"', 

it is evident that d + v = n + 1 and w = n + 1 — d- According to 
Lemma III, 7(g, ti, rf) = J((/, V, m). 

(vi) Greatest special case: Let D be the least w-digit concatenation 
of immediate superiors of 1*. Complementing, D is the greatest m-digit 
concatenation of immediate subordinates of V. In the notation of 
part (iv), D = Y. Letting d = Z7),g'" _ ', d = n — d. Letting n + 1 — 
d = EUiq m ~ , C/ > r because ?i + 1 — d > ?t — d. Theorem follows 
from part (iv) and Lemma III. 

(vi) The general case follows because J(q, U, m) is a monotonic 
function of U. Q.E.D. 



1 870 
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Example I: 


Let V be the binary sequence 


1101. We compute 






Bose distancef 


Designed distance 


m J(q, V, 


m) Binary Decimal 


Binary 


Decimal 


1 1 


1 1 


1 


1 


2 3 


01 1 


01 


1 


3 4 


Oil 3 


010 


2 


4 11 


0011 3t 


001 it 


3 


5 16 


00111 7 


00110 


6 


G 30 


001101 13 


001100 


12 


7 50 


0011011 27 


0011000 


24 


8 91 


00110011 51 


00110000 


48 


9 157 


001100111 103 


001100000 


96 


10 278 


0011001101 205 


0011000000 


192 


11 485 


00110011011 411 


00110000000 


384 


12 854 


001100110011 819 


001100000000 


768 



Here J(q, V, m) is computed by Theorem lv. The designed distances 
are computed according to Theorem Ivi, using V - 0010, with im- 
mediate superiors 1, 01, and 0011. V * (Q - 1) = 001011111111 

Evidently, the binary BCH code of block length 2 12 - 1 and designed 
distance 768 is identical to the binaiy BCH code of block length 2 12 - 1 
and designed distance 769 or 770 or • • • or 819. This code has 854 
information symbols. This code is distinct from the binary BCH code 
of block length 2 12 — 1 and designed distance 820. This is true in 
general, because the least ra-digit concatenation of immediate superiors 
of V is necessarily minimum among all of its own cyclic shifts. This 
"greatest designed distance" is called the Bose distance. 

It happens that the binary BCH code of block length 2 12 - 1 and 
designed distance 768 is also distinct from the binary BCH code of 
block length 2 12 — 1 and designed distance 767, because the 12-digit 
binaiy expansion of 767 is minimum among all of its cyclic shifts. 
This, however, need not be true in general. For example, the binaiy 
BCH code of block length 2 4 - 1 and designed distance 3 is not distinct 
from the binary BCH code of block length 2 4 - 1 and designed distance 
2, because the 4-digit binaiy expansion of 2 = 0010 is not minimum 
among its cyclic shifts; the minimum is 0001. 

t Defined later in the text. 

JThis code is identical to the binary BCH code of block length 15 and de- 
signed distance 2. 
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In general, we would like to determine the number of information 
digits in the g-ary BCH code of block length n — q m — 1 and designed 
distance d = 2D,g m ~\ The previous theorem gives us a solution to 
this problem if we can find a sequence V which is greater than all of 
its own suffixes and has the property that 

V * (Q - 1) < D ^ 

least wi-digit concatenation of immediate superiors of V. 

Complementing this condition gives 

V > D ^ 

greatest m-digit concatenation of immediate subordinates of V. 
or 

V > D* (Q - 1) > 

greatest ra-digit concatenation^ 

vof immediate subordinates of VJ * 6 > X, 

where X is the greatest immediate subordinate of V. We may assume 
that V has no terminal zeros, and that the length of V does not exceed 
the length of D. Since X and V have the same length, X is a prefix of D. 
Since V is the least immediate superior of X, the problem of finding 
V is reduced to the problem of finding X, which is a prefix of D. The 
solution is as follows: 

Theorem 2: Let X be the shortest prefix of D such that 

D = X * F, F * (Q - 1) ^ D * (Q - 1), 

and let V be the least immediate superior of X. Then 

(i) V * (Q - 1) < D ^ 

least m-digit concatenation of immediate superiors of V. 
(ii) V exceeds all of its own proper suffixes. 
Proof of {i): 

Since X is a prefix of D and V is an immediate superior of X, V 
is an immediate superior of D. So V > D and V > D * (Q — 1). 
Complementing gives V * {Q — 1) < D * 6, so V * (Q - 1) < D. 



Let X™ = X * X * ■ • • * X. Then F * (Q - 1) ^ D * (Q - 1) is 

equivalent to X™ * F * (Q - 1) > X (1) * F * (Q - 1). Therefore, 
X*X' 0, *F*(Q- 1) %X*X ll) *F*(Q- l)orX (1) * F * {Q -1 ) ^ 
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X (2) * F * (Q - 1). By induction, X (k) * F * (Q - 1) ^ X (t+1> * F * 
(Q - 1) and D * (Q - 1) ^ X (i) * F * (Q - 1) for aU fc. Since this is 
true for arbitrarily large k, D* (Q — 1) ^ X. Complementing, D*6 ^ 

X ^ any infinite concatenation of immediate superiors of V. There- 
fore, D ^ any ra-digit concatenation of immediate superiors of V. 

Proof of (it): 

Let X = Y * Z * L, where Y and Z are arbitrary (possibly empty) 
and L is the final digit of X. We have 

V = Y * Z * (L + 1) 

D = Y * Z * L* F 

F * (Q - 1) ^ Y * Z * L * F * (Q - 1) = X * F * (Q - 1) ; 

X * F * (Q - 1) - F * Z * L * F * (Q - 1) > Z * L * F * (Q - 1), 
else Y would be a shorter prefix than X which satisfied the same condi- 
tions. 

No proper suffix of V can equal V, for the suffix must be shorter. 

If some proper prefix of V, say Z * (L + 1), (Z possibly empty) 
exceeds V, then 

Z * (L + 1) > Y * Z* (L + 1) > Y * Z * L = X. 

If Z * L > X, then Z * L * F * (Q - 1) > X * F * (Q - 1), a con- 
tradiction. If Z * L is a prefix of X, then X = Z * L* G and from 

X * F * (Q - 1) > Z * L * F * (Q - 1) 

we have 

Z*L*G*F*(Q-1)>Z*L*F*(Q-1) 

G * F * (Q - 1) > F * (Q - 1) ^ X * F * (Q - 1). 

Now Z * L is a shorter prefix than X, a contradiction. Therefore, 
Z * (L + 1) < Y * Z * (L + 1), i-e., V exceeds all of its own proper 
suffixes. Q.E.D. 

Example II: Let q = 9, n = 728 = 9 3 - 1, d = 217. Then D = 261, 
D = 627, D * 8 = 627888 • • • , X = 62, V = 63, V = 25. 

Bose distance 
q-ary decimal 

3 3 

26 24 

263 219 

t In this and subsequent tables we use the single later J as an abbreviation 
for J(q, V, to). 



m 


J\ 


1 


6 


2 


42 


3 


270 
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Example III: Let q = 2, n = 511, d = 185. Then D = 010111001, 
D = 101000110, D * 1 = 101000110111111 • • • , X = 101000, V = 
101001, V = 010110. Immediate superiors are 010111, 011, 1. 

m J Bose distance Smaller designed distance 



1 


1 


1 


1 


2 


1 


11 


11 


3 


4 


011 


Oil 


4 


5 


0111 


0111 


5 


6 


01111 


01111 


r> 


10 


010111 


010111 


7 


22 


0101111 


0101110 


8 


29 


01011111 


01011100 


9 


49 


010111011 


010111000 



Example IV: Let q = 2, n = 511 = 2 9 - 1, d = 187. Then D = 
010111011, D = 101000100 = X, V = 101000101, V = 010111010. 

m J Bose distance 



1 


1 


1 


2 


1 


11 


3 


4 


Oil 


4 


5 


0111 


5 


G 


01111 


6 


10 


011011 


7 


22 


0101111 


8 


29 


01011111 


9 


49 


010111011 



The answer agrees with Example III, although the recurrence is 
different. This illustrates the general nonuniqueness of V. Theorem 
2 specifies one satisfactory method of finding V, but as seen from 
this example, this V need not be unique. The simplest recurrence 
rule generally arises from the shortest possible V, winch corresponds 
to the greatest V, or the least D. This can generally be found by first 
reducing D insofar as permissible. 

Example V: Let n = 2" - 1, d = 411. Then D = 00110011011. 
We could take D = 11001100100 = X and proceed. However, we 
instead consider d = 410, D = 00110011010. Since D has a cyclic 
shift smaller than itself, the code is unchanged. But D = 11001100101, 
X = 1100110010 does not look much easier, so we continue. Each 
prime marks the starting point of a smaller cyclic shift. 
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D 

oonoonoi'o 

OOllOOll'OOl 

001 10011 '000 

ooirooioiu 

0011'OOIOHO 



0011 '0010000 
OOll'OOOllll 



0011 '0000000 
00101111111 

Since 00101111111 has no cyclic shift less than itself, this designed 
distance is the Bose distance of a different BCH code. We must instead 
use D = 00110000000, D = 11001111111, X = 1100, V = 1101. The 
recurrence is given in Example I: 1(2, 2 11 — 1,411) = 485. This 
same V is obtained if we started with D = 00110011000, or any D 
in the region 

00110000000 ^ D ^ 00110011000 

Example VI: Let q = 2, n = 2 16 - 1, D = 001010010100111, D * i = 
1101011010110001111- ■ -, X= 110101101011000, 7 = 110101101011001, 
V = 001010010100110 



m 


/ 


Bose distance 


1 


1 


1 


2 


3 


01 


3 


4 


011 


4 


11 


0011 


5 


16 


00111 


(3 


36 


001011 


7 


64 


0010101 


S 


115 


00101011 


9 


211 


001010011 


10 


378 


0010100111 


11 


694 


00101001011 


12 


1256 


001010010101 


13 


2276 


0010100101011 


14 


4112 


00101001010111 


15 


7474 


001010010100111 
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Although the brute force method just used gives the right answer, a 
more devious approach proves easier. Instead of D = 001010010100111, 
let us consider D = 001010010100101, X = 11010, V = 11011, V = 
00100. This yields a different set of codes, with a much simpler re- 



currence: 










m 


J 


Bose distance 




1 


1 


1 




2 


3 


01 




3 


4 


011 




4 


11 


0011 




5 


21 


00101 




6 


36 


001011 




7 


64 


0010101 




S 


115 


00101011 




9 


211 


001010011 




10 


383 


0010100101 




11 


694 


00101001011 




12 


1256 


001010010101 




13 


2276 


0010100101011 




14 


4126 


00101001010011 




15 


7479 


001010010100101 



The code with D = 001010010100111 has 5 less information digits 
than the code with D = 001010010100101, corresponding to the 5 
distinct cyclic shifts of 001010010100101. 



I. ASYMPTOTIC RESULTS 

Let us define the enumerator 



J(q, U;z) = Yl ^(?. U, ^K 1 - 

m = l 

Given a sequence V which is less than all of its own proper suffixes, 
we may also define 



so that 



V(z) = Z>*2*, 

k 

zV'(z) = Z kVS. 



The recurrence 

J(q, V, m) = mV m + £ V k J(q, V, m - k) 
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becomes 

J(g,V;z) = zV'(z) + V(z)J(q,V;z) 
whose solution is 

zV'(z) 



J(q, V;z) = 



1 - V(z) 



Let pi , p 2 , • • • be the (not necessarily distinct) complex reciprocal 
roots of 1 - V(z). Then 

1 " V(z) = II (1 " pA 

- v(z) = - z p.. n a - PiZ) 



tt*i 



zV'(z) v^ PiZ v-> v t \« V 1 V ™ "■ 

Therefore, 

«/(?, V;*) - EE pV 

m = l «' 

SO 

/(<?, v, m) = E P r , 

where p,- are the complex numbers defined by the equation 
1 ~ F« = 11(1- P.2). 

Although this gives an explicit expression for J(q, V, m), the expression 
depends upon the complex numbers p, . For finite values of m, it is 
usually easier to compute J (q, V, m) directly from the recurrence 
relation of the previous section, since these calculations involve only 
integers. For asymptotic results, however, the above equation is very 
useful. 

Definition: Let p = max | p< |, let s = log a p. 

i 

Since all coefficients of the polynomial V (2) must be nonnegative 
integers not exceeding g — 1, it is easily seen that the p, with the 
maximum absolute value is real and positive, and 1 ^ p ^ q. Clearly, 

J(q, V, m) f=si p m 
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for large m, in the sense that 

lim p~ m J{q, V, m) = 1. 

Similarly, 

log, J(q, V, m) W m log, p = ?hs. 
If 

• = 1 

and 

x ^ u ^ v * 6, 

where V exceeds all of its own suffixes and X is the maximum sub- 
ordinate of V, then 

I(q, q" - 1, uq") « q"". 

In other words, if we fix the fraction d/n = u and let n and d grow 
large, then 



I pan' 



or, more precisely, 



wi 

For given 5, the function s(w) is a rather complicated animal. To 
compute it, one must first write u in 5-ary. If U exceeds all of its proper 
suffixes, set V = U; otherwise write U = X * F where X is the shortest 
prefix such that U S F. V is then taken to be the least immediate 
superior of X. Then, s is defined as the logarithm (base q) of the maxi- 
mum reciprocal root of 1 — V(z). 

It may easily be shown that s is a continuous, monotonic nonincreas- 
ing function of it. It may also be shown that the derivative of s with 
respect to u is either or it is undefined. There are two kinds of points 
at which the derivative s'(u) is undefined. First, there are the end- 
points of the intervals on which s(u) is constant, u is a lower endpoint 
of such an interval iff U is a finite sequence which exceeds all of its 
own proper suffixes; u is an upper endpoint of such an interval iff U 
is a periodic sequence, equal to some of its suffixes but not less than 
any others. At these endpoints, s(w) is undifferentiate because it has 
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only a right derivative or a left derivative, but not both. There is 
only a countable number of points of this type. 

The more interesting points are those at which s(u) has neither a 
right derivative nor a left derivative. This happens iff U is an infinite 
sequence which exceeds all of its own proper suffixes, and 6 is not a 
suffix of U. 

The set of points u such that s(u) is not differentiate is uncountable, 
but it has measure 0. Professor T. Pitcher of the University of Southern 
California has also shown 3 that this set has Hausdorf dimension 1. 
This appears to be entirely due to the large density of these points 
in the vicinity of u = 0. In general, I conjecture that the Hausdorf 
dimension of the set of points in the interval a ^ u ^ b [where ^ a, 
6^1, s{a) t* s(b)] is s(a). In some sense, almost all of the nondif- 
ferentiable points in any interval seem to lie very near the leftmost 
cluster point of the interval. 

When Mann 2 first obtained results identical to those here in the 
special cases u = q~ k , he also showed that p is the only reciprocal 
root of 1 — V(z) with magnitude greater than 1. Thus, not only is 

Itt P m , 

but in fact, for sufficiently large m, 

I = (P m ), 

where (•) denotes the nearest integer to "•". Unfortunately, this 
strengthened result is not true in general. For some values of it, 1 — V(z) 
has only one reciprocal root with magnitude greater than 1, but for 
other values of u, 1 — V(z) has many reciprocal roots with magnitude 
greater than 1. Little is known about the behavior of the smaller 
complex reciprocal roots of 1 — V(z) as a function of u, although 
B. F. Logan 4 has obtained a few preliminary results in this area. 

II. ACTUAL DISTANCE 

As one increases the designed distance, the number of information 
symbols in the resulting code must either remain constant or decrease. 
Thus, 

I(q, n, d) is the maximum number of information symbols in any of the 
q-ary BCH codes with designed distance ^ d. 

We must be careful to distinguish between / and /, defined by 



INFORMATION SYMBOLS IN BCH CODES 1879 

l(q, n, d) is the maximum number of information symbols in any of the 
q-ary BCH codes with actual distance ^ d. 

It is obvious that l(q, n, d) ^ I{q, n, d). 

For example, there are three binary BCH codes of block length 23, 
having 23, 12, and 1 information symbols. The code with 23 informa- 
tion digits has Bose distance = actual distance = 1, but the code 
with 12 information digits has Bose distance 5, actual distance 7. 
The code with 1 information digit has Bose distance == actual distance = 
23. Therefore, 1(2, 23, 6) = 1(2, 23, 7) = 1, but 1(2, 23, 6) = 1(2, 23, 7) = 
12. For all values of d ^ G or 7, 1(2, 23, d) = 1(2, 23, d). 

The known cases in which I(q, n, d) > I(q, n, d) are relatively sparse. 
Peterson, Kasami, and Lin 5 and Berlekamp 6 have investigated this 
question for narrow sense binaiy BCH codes (where q = 2 and n = 
2 m - 1). They proved that 1(2, 2'" - 1, d) = 1(2, 2 m - 1, d) if d divides 
2 m — 1, or if d is one less than a power of 2, or if m' divides m and 
1(2, 2 m ' - 1, d) = 1(2, 2'"' - 1, d) > 1(2, 2'"' - 1, d + 1), or if m is 
sufficiently small, or if d is sufficiently small, or if d and/or m satisfy 
any of various other number theoretical constraints. More recently, 
Peterson and Lin 7 have shown that if 1(2, T - 1, d) = 1(2, 2 m - 1, d) > 
1(2, 2 m - 1, d + 1), and 1 S j ^ m - d then 1(2, 2 m - 1, 2'd + 2'" - 1) 
= 1(2, 2 m — 1, 2'd + 2' — 1). No examples are known in which 1(2, 
2 m — 1, d) > 1(2, 2 m — 1, d), and it has been conjectured that 1(2, 
2 m - 1, d) = 1(2, 2 m - 1, d) for all m and d. 

Although this conjecture remains open, we can obtain certain results 
about the asymptotic behavior of 7(2, 2 m — 1, u2 m ) from the known 
classes of special cases in which 1(2, 2 m — 1, d) = 1(2, 2 m — 1, d). 
We would like to define 

v ^ '• r 1032/(2,2- - l,^2 m ) 

s(u) = hm — 2 — * • 

Unfortunately, however, we have no assurance that the limit exists. 
In order to discuss the asymptotic behavior of the best BCH codes, 
we define 

—, 7. log 2 /(2,2 m - 1,^2-) 

ski) = hm sup — - — * • 

in-* in 

Clearly s(u) ^ s(u). Like s(u), s(u) must be a monotonic nonincreasing 
function of u, because if d' > d, the codewords of the g-ary BCH code 
of distance d' are a subset of the codewords of the g-ary BCH code of 
distance d. 
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We can prove that i(u) = s(u) for certain values of u, as indicated 
by the following theorem: 

If u ^ 2~ k , then s(u) g s(2"*) 

Proof: We know that if u ^ 2~ h , then 

1(2, 2-~\ w2 m ) ^ /(2, 2" - 1, 2 m_fc - 1) 

= 7(2, 2 m - 1, 2 m ~* - 1) m ^ A: 

Hence, 

log 7(2, 2 m - l,M2 m ) < log /(2,2 m - 1, 2 W ~* - 1) 
m = m 

So 



s(w) ^ lim 



log 7(2, 2 m - 1,2-*- 1) = s(2 - A) 



because s(w) is continuous. Q.E.D. 

This shows that $(u) = s(u) if u = |, |, |, ■ • • . Similarly, one can 
show from the recent theorem of Peterson and Lin that s(u) = s(u) 
for certain other values of u. 

We conjecture that s(u) = s(u) for all u. This is a weakened form 
of Peterson's conjecture that 1(2, 2 m - 1, d) = 7(2, 2 m — 1, d) for 
all m and d. 
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